[[bellman_equation|bellman equation]] - anagora.org

query:

↳ on the Internet: Google | DuckDuckGo | Searx | Wikipedia

↳ in this Agora ⬎

📚 node [[bellman_equation|bellman equation]]

Welcome! Nobody has contributed anything to 'bellman_equation|bellman equation' yet. You can:

Write something in the document below!
- There is at least one public document in every node in the Agora. Whatever you write in it will be integrated and made available for the next visitor to read and edit.
Write to the Agora from social media.
- If you follow Agora bot on a supported platform and include the wikilink [[bellman_equation|bellman equation]] in a post, the Agora will link it here and optionally integrate your writing.
Sign up as a full Agora user.
- As a full user you will be able to contribute your personal notes and resources directly to this knowledge commons. Some setup required :)

⥅ related node [[bellman_equation]]

⥅ node [[bellman_equation]] pulled by Agora

📓 garden/KGBicheno/Artificial Intelligence/Introduction to AI/Week 3 - Introduction/Definitions/Bellman_Equation.md by @KGBicheno

Bellman equation

Go back to the [[AI Glossary]]

In reinforcement learning, the following identity satisfied by the optimal Q-function:

The Q-function in reinforcement learning

Reinforcement learning algorithms apply this identity to create Q-learning via the following update rule:

The Bellman equation

Beyond reinforcement learning, the Bellman equation has applications to dynamic programming. See the Wikipedia entry for Bellman Equation.

📖 stoas

public document at doc.anagora.org/bellman_equation|bellman-equation
video call at meet.jit.si/bellman_equation|bellman-equation

⥱ context

← back
ai glossary

↑ pushing here
(none)

↓ pulling this
(none)

→ forward
(none)

🔎 full text search for 'bellman_equation|bellman equation'